Posterior - Based Keyword Spotting Approaches Without Filler Models — Iterative Viterbi Decoding and One - Pass Approaches —
نویسنده
چکیده
This paper addresses the problem of detecting keywords in unconstrained speech without explicit modeling of non-keyword segments. The proposed algorithms are based on recent developments in confidence measures using local posterior probabilities, and searches for the segment maximizing the average observation posterior along the most likely path in the hypothesized keyword model. We can also use more complex matching scores. As known, this approach (sometimes referred to as sliding model method) requires a relaxation of the begin/endpoints of the Viterbi matching, as well as a time normalization of the resulting score, making dynamic programming sub-optimal or more complex (more computation and/or more memory). We present here alternative (quite simple and efficient) solutions to this problem. One of them uses an iterative form of Viterbi decoding algorithm, but which does not require scoring for all possible begin/endpoints. Convergence proof of this algorithm is available. Results obtained with this method on 100 keywords chosen at random from the BREF database [5] are reported. Finally, we also develop a new pruning strategy of a one-pass approach that can optimize the same criteria, and yields the same recognition performance than the iterative Viterbi approach. This report is an extension of [8], where a detailed convergence proof of the Iterative Viterbi Decoder is provided, together with an extension to the one-pass decoder.
منابع مشابه
Iterative Posterior-based Keyword Spotting without Filler Models: Iterative Viterbi Decoding and One-pass Approach
This paper addresses the problem of detecting keywords in unconstrained speech without explicit modeling of non-keyword segments. The proposed algorithm is based on recent developments in confidence measures based on local a posterior probabilities, together with the (known) approach of relaxing the begin/endpoints in a Dynamic Programming (DP) matching of partial hypothesized sub-sequences on ...
متن کاملPosterior - Based Keyword Spotting Approaches Without
This paper addresses the problem of detecting keywords in unconstrained speech without explicit modeling of non-keyword segments. The proposed algorithms are based on recent developments in confidence measures using local posterior probabilities, and searches for the segment maximizing the average observation posterior along the most likely path in the hypothesized keyword model. We can also us...
متن کاملAn Efficient Keyword Spotting Techni Language for Filler Mo
The task of keyword spotting is to detect a set of keywords in the input continuous speech. In a keyword spotter, not only the keywords, but also the non-keyword intervals must be modeled. For this purpose, filler (or garbage) models are used. To date, most of the keyword spotters have been based on hidden Markov models (HMM). More specifically, a set of HMM is used as garbage models. In this p...
متن کاملKeyword Spotting Using Durational Entropy
This paper deals with the task of detection of a given keyword in continuous speech. We build upon a previously proposed algorithm where a modified Viterbi search algorithm is used to detect keywords, without requiring any explicit garbage or filler models. In this work, the concept of durational entropy is used to further discard a large fraction of false alarm errors. Durational entropy is de...
متن کاملLexical Access-based Confidence Measure for a Spanish Keyword Spotting System
Keyword spotting deals with the search of a reduced set of keywords in audio content. Phone Lattice-based approaches are very fast but achieve poor results. HMM-based keyword spotting systems deal with filler models to absorb the Out-of-vocabulary (OOV) words and achieve best results although they are slower. We propose a technique which combines them in order to perform a confidence measure to...
متن کامل